NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

FreDDI: Frequency-Driven DNN Partitioning in Distributed Inference

https://doi.org/10.1109/SMARTCOMP65954.2025.00106

Viramontes, Robert; Davoodi, Azadeh (June 2025, IEEE)

Free, publicly-accessible full text available June 16, 2026
ReBERT: LLM for Gate-Level to Word-Level Reverse Engineering

https://doi.org/10.23919/DATE64628.2025.10993097

Zhang, Lizi; Davoodi, Azadeh; Topaloglu, Rasit Onur (March 2025, IEEE)

Free, publicly-accessible full text available March 31, 2026
CADI: Carbon-Aware Distributed Inference

https://doi.org/10.1109/COINS65080.2025.11125725

Viramontes, Robert; Davoodi, Azadeh (August 2025, IEEE)

Free, publicly-accessible full text available August 4, 2026
Static IR Drop Prediction with Limited Data from Real Designs

https://doi.org/10.1145/3658617.3697592

Zhang, Lizi; Davoodi, Azadeh (January 2025, ACM)

Free, publicly-accessible full text available January 20, 2026
Efficient and Effective Neural Networks for Automatic Test Pattern Generation

https://doi.org/10.1145/3670474.3685939

Zhang, Lizi; Davoodi, Azadeh (September 2024, ACM)

Full Text Available
DIME: Distributed Inference Model Estimation for Minimizing Profiled Latency

https://doi.org/10.1109/SMARTCOMP61445.2024.00081

Viramontes, Robert; Davoodi, Azadeh (June 2024, IEEE)

Full Text Available
Neural Network Partitioning for Fast Distributed Inference

https://doi.org/10.1109/ISQED57927.2023.10129343

Viramontes, Robert; Davoodi, Azadeh (April 2023, IEEE)

The rising availability of heterogeneous networked devices highlights new opportunities for distributed artificial intelligence. This work proposes an Integer Linear Programming (ILP) optimization scheme to assign layers of a neural network in a distributed setting with heterogeneous devices representing edge, hub, and cloud in order to minimize the overall inference latency. The ILP formulation captures the tradeoff between avoiding communication cost when executing consecutive layers on the same device versus the latency benefit due to weight preloading when an idle device is waiting to receive the results of an earlier layer across the network. In our experiments we show the layer assignment and inference latency of a neural network can significantly vary depending on the types of devices in the network and their communications bandwidths.
more » « less
Full Text Available
ObfusX: Routing obfuscation with explanatory analysis of a machine learning attack

https://doi.org/10.1016/j.vlsi.2022.10.013

Zeng, Wei; Davoodi, Azadeh; Topaloglu, Rasit Onur (March 2023, Integration)

Full Text Available
$$\text{Edge}^{n}$$ AI: Distributed Inference with Local Edge Devices and Minimal Latency

https://doi.org/10.1109/ASP-DAC52403.2022.9712496

Hemmat, Maedeh; Davoodi, Azadeh; Hu, Yu Hen (January 2022, EdgenAI: Distributed Inference with Local Edge Devices and Minimal Latency)

Full Text Available
Lorax: Machine Learning-Based Oracle Reconstruction With Minimal I/O Patterns

https://doi.org/10.1109/ISVLSI51109.2021.00033

Zeng, Wei; Davoodi, Azadeh; Topaloglu, Rasit Onur (July 2021, 2021 IEEE Computer Society Annual Symposium on VLSI (ISVLSI))

Full Text Available

« Prev Next »

Search for: All records